Consistent frequency-based sound matches to natural visual scenes
نویسندگان
چکیده
منابع مشابه
Learning to Localize Sound Source in Visual Scenes
Visual events are usually accompanied by sounds in our daily lives. We pose the question: Can the machine learn the correspondence between visual scene and the sound, and localize the sound source only by observing sound and visual scene pairs like human? In this paper, we propose a novel unsupervised algorithm to address the problem of localizing the sound source in visual scenes. A two-stream...
متن کاملFrequency of metamerism in natural scenes.
Estimates of the frequency of metameric surfaces, which appear the same to the eye under one illuminant but different under another, were obtained from 50 hyperspectral images of natural scenes. The degree of metamerism was specified with respect to a color-difference measure after allowing for full chromatic adaptation. The relative frequency of metameric pairs of surfaces, expressed as a prop...
متن کاملNatural scenes upset the visual applecart.
The effortless ease of everyday vision seems to contradict numerous findings on the limited capacity of visual attention. However, natural scenes appear to escape the stringent limitations of attention that apply to seemingly far simpler stimuli. This astonishing result will oblige us to rethink the nature of visual attention and its limited capacity.
متن کاملCortical Sensitivity to Visual Features in Natural Scenes
A central hypothesis concerning sensory processing is that the neuronal circuits are specifically adapted to represent natural stimuli efficiently. Here we show a novel effect in cortical coding of natural images. Using spike-triggered average or spike-triggered covariance analyses, we first identified the visual features selectively represented by each cortical neuron from its responses to nat...
متن کاملEffectively Leveraging Visual Context to Detect Texts in Natural Scenes
Detecting texts in natural scenes is challenging because of large variation in size and layout of texts and strong distractions from background clutters. Leveraging contextual information is crucial in boosting the detection accuracy. In this paper, we construct a conditional random field (CRF) to utilize visual context that helps enhance true detections and suppress false alarms. Unlike previo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Vision
سال: 2011
ISSN: 1534-7362
DOI: 10.1167/11.11.795